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Abstract 

As an increasing number of ecosystems face departures from long standing environmental conditions under climate 
change, our understanding of the capacity of species to adapt will become important for directing conservation and 
management of biodiversity. Insights into the potential for genetic adaptation might be gained by assessing genomic 
signatures of adaptation to historic or prevailing environmental conditions. The river red gum {Eucalyptus camaldulensis 
Dehnh.) is a widespread Australian eucalypt inhabiting riverine and floodplain habitats which spans strong environmental 
gradients. We investigated the effects of adaptation to environment on population level genetic diversity of E. 
camaldulensis, examining SNP variation in candidate gene loci sampled across 20 climatically diverse populations 
approximating the species natural distribution. Genetic differentiation among populations was high (Fst= 17%), exceeding 
previous estimates based on neutral markers. Complementary statistical approaches identified 6 SNP loci in four genes 
(COIVIT, Dehydrin, ERECTA and PIP2) which, after accounting for demographic effects, exhibited higher than expected levels 
of genetic differentiation among populations and whose allelic variation was associated with local environment. While this 
study employs but a small proportion of available diversity in the eucalyptus genome, it draws our attention to the potential 
for application of wide spread eucalypt species to test adaptive hypotheses. 
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introduction 

Trees are foundation species in many terrestrial ecosystems and 
changes to local environment associated with natural and 
anthropogenic climate change are projected to impact, or are 
already impacting, the health of forest tree populations and the 
ecosystems they service worldwide [1-5]. As a greater number of 
species confront significant environmental change, it is becoming 
important to understand the factors influencing the capacity of 
populations to adapt and to monitor these [6-8]. Forest 
populations commonly exhibit evolved mechanisms to cope with 
prevailing environmental conditions, evident from the character- 
isation of adaptive phenotypic and genetic diversity in environ- 
mentally contrasted populations [9-11]. However in order to 
persist populations must retain the ability to adapt when 
conditions change [12]. Broadly speaking, plant populations adapt 
to environmental change through a combination of mechanisms 
including reversible changes to their physiological or morpholog- 
ical phenotypes independent of genotype (e.g. phenotypic plastic- 
ity); adaptation of phenotypes via changes in allelic composition as 
a result of selection (e.g. genetic adaptation); and migration to 



more suitable environments (e.g. seed or poUen dispersal). In some 
forest trees these responses will be insufficient to track rapid 
climate redistribution [5,13], depending on the rate and magnitide 
of enviroimnetal change, physiological tolerances, life-history 
strategies (e.g. generation time), dispersal abilities, population 
dynamics, interspecific competition and levels of genetic diversity 
[6,13,14]. 

Reversible plastic responses are important for short term 
adaptation of natural populations [15], but in the long term, 
permanent adaptations reflecting locally prescribed changes in the 
underlying genetics of adaptively important traits will be required 
to preserve population fitness. The rate at which genetic 
adaptation can occur, with respect to the rate of environmental 
change, will be key to the persistence of many species impacted by 
climate change [13,16,17]. The capacity for genetic adaptation 
depends on multiple factors including the level of pre-existing or 
standing genetic diversity, effective population size, strength of 
selection and life history traits such as generation time and 
fecundity [15,18,19]. Consequently rates of genetic adaptation are 
likely to be highly variable among species and populations. 
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Insights into a species potential for genetic adaptation could be 
gained through characterisation of the relative abundance of 
adaptive versus neutral variation in response to historic or 
prevailing environmental conditions, which might be applied to 
empirically test hypotheses about environmental adaptation. 
Forest trees are tractable models for adaptive genetic studies 
owing to widespread populations traversing environmental gradi- 
ents, high levels of diversity and low frequency of co-segregation 
among gene loci (linkage disequilibrium) [10]. For tree species of 
commercial importance the availability of common gardens has 
revealed strong adapti\'e clines in wood, growth and phenology 
traits [9,13,20]. Population and landscape genetic studies have 
recently suggested abundant adaptive genetic variation underlying 
these differences [21-33]. Plausible links between adaptive 
genotypes and phenotypic variation give further insight into the 
biological basis for adaptive genetic variation in sevearl cases 
[10,25]. 

Significant areas of forest and woodland growing in marginal or 
semi-arid regions of Australia are currently at risk of decline in 
response to climate change. The river red gum {Eucalyptus 
camaldulensis Dehnh.) is a large tree found in riparian zones and 
associated floodplains in arid and semi-arid regions throughout 
Australia [34] . River red gum depend on flooding for recruitment, 
but adults can withstand prolonged periods of drought. Changes to 
watering regime, resulting from decreased flood frequency as a 
result of both river regulation and prolonged drought (hotter, drier 
conditions) in south-eastern Australia, have caused populations of 
river red gum within the Murray-Darling Basin to decline [35-37]. 
The extent of genetic adaptation among widely distributed 
populations could inform our understanding of adaptation to 
climate in this species, as well as reveal candidate gene loci that 
may be important as genetic markers to assess adaptive potential 
and guide conserx^ation. 

Across the Australian continent E. camaldulensis traverses 
strong environmental gradients and has likely evolved mechanisms 
to cope with variation in water availability. Variation in 
morphology between provenances suggests populations may be 
locally adapted [38,39]. Genetic diversity and evidence of genetic 
adaptation have also been assessed. A survey of genetic diversity 
based on microsatellite loci revealed high levels of genetic diversity 
that exhibits geographically defined structure [40]. Genetic 
difierentiation among populations was correlated with environ- 
ment, however this relationship was attributed to historical, 
demographic factors rather than selection. DNA sequence 
variation in E. camaldulensis revealed high levels of nucleotide 
diversity in genes within secondary metabolite biosynthetic 
pathways [41], and ratios of non-synonymous to synonymous 
polymorphism imphed positive selection. Thumma et al. [42] 
performed whole transcriptome profiling to identify genes that 
may be important to drought response. Sequence analyses 
revealed high ratios of non-synonymous to synonymous polymor- 
phism in nearly 300 genes which were identified as putative targets 
of positive selection, with a third of these being differentially 
expressed between drought treatments. A recent study of adaptive 
variation at the whole genome scale in E. camaldulensis sampled 
from four sites in northern Australia revealed signatures of 
adaptation based on nucleotide sequence level tests [33]. Nearly 
2000 SNP loci were identified whose alleles were differentiated 
between pairs of environmentally contrasted sampling locations. 

Given the broad geographic and environmental distribution of 
E. camaldulensis it is desirable survey patterns of diversity at 
genetic loci that could be targets of selection, e.g. coding genes, in 
population samples spanning the natural range of the species. The 
earlier microsateUte study extensively sampled the natural 



populations, however the marker systc-m applied was not suitable 
for studies of adaptation. Conversely, studies of genetic adaptation 
have targeted only a narrow subset of the available population 
diversity or did not allow comparison among populations. In this 
study we examined genetic diversity and divergence of 59 SNP 
markers sampled from twelve candidate gene loci in 20 
populations of E. camaldulensis distributed across the species 
natural range. Several tests for evidence of genetic adaptation were 
performed upon individual SNPs as well as set of SNPs 
representing whole genes, and correlations with environmental 
j)aramct('rs \vere investigatc'd \'ia association studies. The results 
suggest selec:tion has driven diversity among populations for some 
genes and highlights the amenability of this species for further 
landscape level studies of adaptation employing larger numbers of 
individuals, populations and SNP loci. 

Methods 

Populations 

Ten trees were sampled per population across 20 populations 
spanning the natural distribution of £. camaldulensis, representing 
a subset of the collection previously published by Butcher et al. 

[40] (Table 1; Figure 1). Individual tree DNA samples were 
archived at the CSIRO Plant Industry laboratories, Canberra, and 
were used with permission. In total, 2 \lg of diploid genomic DNA 
previously extracted from leaves using a modified CTAB protocol 
[43] was further purified on QIAGEN QIAquick PGR purification 
columns ac:c:ording to the manufacturer's instructions. Data for 15 
microsatellite (SSR) loci previously generated by Butcher et al. 
[40] in the same 20 populations (with the exception of Wirrengren 
Plain) was made available and included in downstream analyses, 
serving as a benchmark for neutral, demographic effects on 
population genetic diversity. 

Environmental data 

The populations sampled extend across the natural range of 
river red gum Australia wide, including six subspecies, traversing 
broad environmental gradients from central Australia to the wet 
tropics. Twenty one parameters reflecting variation in environ- 
ment at each site were extracted from the Atlas of Living Australia 
website at http://w\™'.ala.org.au, accessed 24 June 2013 (Table 
SI). To reduce redundancy in the data set the total number of 
environmental variables was reduced to a set of six multivariate 
traits foflowing principle component analyses (PCA) implemented 
in the package StatistiXL (Table 1). PCA was performed 
separately on sets of variables grouped broadly into three 
environmental classes: geography, climate and ecology. Where 
more than one principal component was identified in each class, 
those with an Eigen value S 1 and which brought the cumulative 
variance to S50% were selected. Component loadings describing 
the contribution of each environmental variable to the reduced 
component variables for each class are presented in Table S2. 
Each of the reduced environmental traits exhibited a strong 
relationship with latitude (Figure SI). 

Candidate genes, SNP selection and genotyping 

Using single nucleotide polymorjihisms (SNPs) we examined the 
variability of twelve genes in trees sampled across the natural 
range of E. camaldulensis. The candidate genes applied in this 
study were grouped into two functional categories: 1. plant water 
relations (PIP2, Dehydrin and ERECTA), and 2. xylem ceU wall 
development (CAD, CCR, CESAl, CESA3, COBL4, COMTl, 
Korrigan, MYB4 and bZIP). The first group have the potential to 
impact plant water use in response to climate, via directing the 
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Figure 1. Location of £ camaldulensis popu\ations sampled at 20 sites across mainland Australia. Occurrence records off. camaldulensis 
downloaded from the Atlas of Living Australia (small circles) approximate the distribution of this species, which does not occur in Tasmania. 
doi:1 0.1 371 /journal.pone.01 0351 S.gOOl 



movement of water between cellular compartments and out of the 
leaf stomata, or acting as internal cellular stabilisers in response to 
dehydration [44—46]. The second group moderate the physical 
properties of the vascular architecture and wood, which are shown 
to be important adaptive traits that can indirectly influence plant 
responses, for example drought [47]. Thus genetic diversity in 
both sets of genes could potentially reflect adaptation to 
environment. 

Polymorphism data was identified from eleven candidate genes 
for which amplicons representing the open reading frame had 
previously been sequenced on a pooled sample of 500 E. 
camaldulensis individuals drawn from Murray-Darling Basin 
populations in the collection of Butcher et al. [40], using the 
454 high throughput DNA sequencing platform (Roche) (unpub- 
lished data. Table 2). Sequences were trimmed and aligned 
against fuU length reference sequences obtained from the 
Eucalyptus grandis genome sequence (Phytozome: Eucalyptus 



grandis Genome Project 2010; http://www.phytozome.net/ 
eucalyptus) in CLC Genomics Workbench (CLCbio). A twelfth 
gene, ERECTA, was sequenced in 36 individuals drawn from four 
Murray-Darling Basin populations of E. camaldulensis (Yanga 
National Park (12), Menindee (8), Wenthworth (8) and WUcannia 
(8)), which were sampled from the collection of Butcher et al. [40] 
with the exception of Yanga which was sampled from a separate 
collection held at the CSIRO Plant Industry laboratories, 
Canberra. A 1000 base pair fragment of the ERECTA gene was 
amplified by PCR as above using primers designed from the 
coding regions. PCR products were sequenced by Macrogen Pty 
Ltd. Alignment and editing of the sequence data was performed 
using BioEdit [48]. 

Single nucleotide polymorphisms were selected from DNA 
sequence alignments based on the following criteria: a base pair 
change was present, the base position was not an indel and the 
minor SNP allele occurred in at least three individuals. SNPs were 
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Table 2. Candidate genes examined across £ camaldulensis populations. 



abbreviated name 


gene name 


£ grandis homologue* 


length** 


putative function 


CAD 


Cinnamyl-Alcohol Dehydrogenase 


Eucgr.G01350 


900 


lignin biosynthesis, [96] 


CCR 


Cinnamoyl-CoA Reductase 


Eucgr.J03114 


6000 


lignin biosynthesis, [96] 


CesAl 


Cellulose Synthase 1 


Eucgr.D00476 


7000 


cellulose biosynthesis, [97] 


CesA3 


Cellulose Synthase 3 


Eucgr.C00246 


7000 


cellulose biosynthesis, [97] 


C0BL4 


C0BRA4 like gene 


Eucgr.J01392 


3000 


cellulose biosynthesis, [98] 


COMT 


Caffeate 3-0-methyltransferase 1 


Eucgr.A01397 


2000 


lignin biosynthesis, [96] 


Dehydrin 


Dehydrin like protein 


Eucgr.100186 


1000 


water stress response, [99] 


ERECTA 


Erecta leucine rich repeat protein 


Eucgr.C0073 


4500 


water use efficiency, [45] 


Korrigan 


Korrigan (Endo-1,4-p-Glucanase) 


Eucgr.G00035 


2500 


cell wall expansion, [100] 


MYB4 


MYB4 Transcription Factor 


Eucgr.G03385 


1800 


lignin biosynthesis [101] 


bZIP 


bZip Transcription Factor 


Eucgr.F01867 


9000 


lignin biosynthesis, [102] 


PIP2 


Plasma Membrane Intrinsic Protein 


Eucgr.D02548 


3500 


water stress response, [44] 



*gene ID from annotated £ grandis genome sequence (www.phytozome, 
**length of sequenced gene region in base pairs. 
doi:l 0.1 371 /journal.pone.Ol 0351 5.t002 



net). 



selected to maximise coverage across the gene and avoid 
redundancy due to linkage disequilibrium, by selecting a SNP at 
least one to two hundred base pairs apart. Gene sequences were 
annotated, and for each SNP the base change, gene position 
(intron/ exon) and amino acid substitution were recorded. Ten 
SNPs were chosen for each gene, with the exception of CAD (8), 
bZIP (7), Dehydrin (5) and Erecta (5). SNP genotyping was 
performed on genomic DNA samples using the Sequenom 
MassARRAY System at the Australian Research Genome Facility 
(AGRF) (Table S3). Genotype calls for all SNP loci across aU 
individuals are provided in file Table S4. 

Linkage disequilibrium (LD) in outcrossing forest tree species is 
typically low, and in eucalypts such as river red gum co- 
segregation of SNP loci along a gene decays within several 
hundred base pairs [41]. Consequently SNP markers typed in 
different genes might be expected to segregate independently. To 
test this, pairwise LD among genotyped loci was assessed using the 
program Tassel [49]. 

Genetic diversity and divergence 

In total, 76 of the 105 selected SNPs were successfully 
genotyped. Monomorphic SNPs and trees with >20% missing 
data were omitted from the data set, resulting in 59 SNPs typed 
across 191 individuals. Observed and expected heterozygosity and 
tests for Hardy-Weinberg Equilibrium (HWE) were performed on 
individual SNPs, and for sets of SNPs representing whole genes, 
for each population separately and overall using GenAlEx 6.2 
[50]. 

Genetic divergence among populations based on all SNP loci 
was investigated using several complementary approaches. Firstly, 
analysis of molecular variance (AMOVA) was performed in 
GenAlEx to partition genetic diversity residing within and among 
populations and individuals. Using the AMOVA framework Fgx 
was also estimated in GenAlEx to obtain both overall and pairwise 
population estimates of genetic differentiation. Significance of the 
observed differentiation was tested by performing 1000 random 
permutations of the data. In the same way. Est was also estimated 
for these populations (with the exception of Wirrengren Plain) 
using polymorphism data for the 15 putatively neutral microsat- 
eUite loci (nuSSR) apphed in Butcher et al. [40] . 



To compare patterns of genetic divergence inferred from the 
different marker systems correlations between matrices of pairwise 
population Fgx were examined via a Mantel test implemented in 
GenAlEx, and significance was based on 1 000 permutations of the 
data. Divergence was also compared via principal coordinate 
analysis (PCoA) of genetic differentiation among populations 
implemented in GenAlEx. Spatial autocorrelation (isolation by 
distance or IBD) of population pairwise genetic divergence based 
on SNP loci was assessed using the Mantel function in GenAlEx, 
and significance was assessed on 1000 permutations. The overall 
pattern of genetic structure reflected in the SNP dataset was 
summarised as a reduced set of orthogonal axes following principal 
component analyses (PCA) in the package StatistiXL. The first 20 
principal components, which had an Eigen value &1, and 
cumulatively accounted for &50% of the genotypic variance in 
the SNP data set, were used to describe genetic structure in 
association tests. 

Detection of adaptive genetic variation 

In this study we investigate signatures of genetic adaptation in 
candidate genes that could underlie adaptive variation in 
environmentally contrasted populations of E. camaldulensis. 
Genetic structure in this species will potentially confound tests 
aimed to detect adaptation based on co-variation of genetic 
markers and environmental parameters, in light of the fact that 
neutral genetic variation and environment are autocorrelated 
among populations [40] . This feature arises from the tendency for 
genetically related populations to be geographically proximate 
(isolation by distance), and that proximate populations tend to 
share similar environments. Considering this we apply a conser- 
vative approach, relying on significance in multiple complemen- 
tary tests, while accounting for neutral patterns of variation to infer 
adaptive signatures. 

FsT outlier tests. The SNP markers employed in this study 
reside within functional genes, and their allele frequencies may be 
subject to locus-specific effects such as selection. One approach to 
detect such effects is to assess observed differentiation (Fst) at 
individual loci (genes or SNPs) with respect to a neutral model 
[51]. Two methods (one Bayesian and one coalescent) were 
applied to test SNP marker differentiation against alternative 
neutral models. 
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Firstly, a neutral distribution of Fgx conditioned on heterozy- 
gosity (Hj.) ranging from 0 to 1 was generated via coalescent 
simulation over 40000 loci applying a Hierarchical Island Model 
that accounts for user defined population structure [52]. Genetic 
differentiation and hierarchical structure (defined by 100 demes 
and 5 nested groups) of the simulated data set was based on the 15 
putatively neutral microsatelite loci (nuSSR) previously applied by 
Butcher at al. [40] . The distribution of simulated loci formed the 
basis of the neutral envelope against which Fgx for SNP loci were 
subsequently tested. The mean, 97.5'l^i (upper) and 2.5% (lower) 
confidence intervals for the simulated Fsi distribution were 
calculated using 'cplot', which is distributed with the Fdist2 
package [53]. Estimates of F^t and He for 59 SNP loci were 
generated using 'datacal', also distributed with Fdist2. Estimates of 
FsT and H,, averaged over whole genes were generated using 
GenAlEx. Significance of marker F^i was tested using the 'pv' 
program distributed with Fdist2. Taking a conservative approach, 
markers which exceeded the upper or lower confidence intervals 
(P<0.01) were classed as outlier loci. Additional oudiers were also 
considered between 0.01<P<0.05. To account for multiple 
testing the false discovery rate (FDR) and q-values for each locus 
was estimated [54]. 

A second approach based on the multinomical-Dirichlet model 
was applied to identify SNP loci that may be under selection using 
the program BayeScan [55]. This method does not require prior 
knowledge of neutral population differentiation and in contrast to 
the island model can consider realistic ecological scenarios where 
size and migration rate differ among populations. Rather than 
comparing sites and testing for outiiers, BayeScan estimates the 
probability of a locus being under selection for two models, one 
that includes the elfect of selection and another that excludes 
selection, using a reversible jump Markov chain Monte-Carlo 
approach. The default parameters for the Markov chain were used 
(20 pilot runs of 5000 iterations, 100,000 total iterations), and the 
program was run twice to check reproducibility. Significance of 
SNP FsT was interpreted from posterior odds using Jeffrey's scale 
[56]. 

Environmental associations. The spatial analysis method 
(SAM) of Joost et al. [57] was applied to detect individual SNP loci 
that may be locally adapted based on association with environ- 
mental variables. This test performed logistic regression of binary 
allele frequency data for all SNPs and environment on individual 
trees, where the significance of regression for each locus was tested 
based on p-values for two statistical tests a) likelihood ratio or G 
statistic, and b) the Wald statistic. Statistical significance in both 
tests was determined after applying the Bonferroni correction. The 
method aims to be conservative by calling an association based on 
significance in both tests. Allele scores for 59 SNPs scored across 
191 trees were first converted to binary information (118 allele 
markers), where each allele was scored as a single locus. 
Component environmental variables, latitude and longitude for 
each individual tree were recorded in the same data frame. 

Tests of association between allelic variation and environment 
were also performed while attempting to account for potentially 
confounding population genetic structure using two alternative 
approaches. Firsdy associations between 6 component traits and 
59 SNP loci were tested across 191 individuals via a least-squares 
frxed effect general linear model implemented in Tassel [49] . The 
statistical model is described by y = XP + e, where y is a vector for 
the observed dependent variable (environment), P is a vector 
containing independent fixed effects, including genetic marker and 
population structure matrices, X is the known design matrix, and e 
is the unobserved vector for the random residual (error) [58]. 
Significant divergence has been detected between populations of 



E. carmUulensis [40], consequently a matrix of individual scores 
for 20 principal components derived from the SNP data set 
describing genetic structure was incorporated in the model. P- 
values were corrected for experiment wise error following 1000 
permutations of the data. 

The second method was used to validate environmental 
associations at six outlier loci identified following analyses with 
both MatSAM and Tassel. This approach utilised a Bayesian 
framework to test correlations between allele frequencies at these 
loci and six component environmental variables against a null 
model specified by the covariance structure of allele frequencies 
across populations [59]. This was achieved by first generating a 
covariance matrix of adlele frequencies among populations via a 
Monte Carlo Markov chain. Putatively neutral makers should be 
used for this step, thus we applied the entire SNP data set 
excluding the six putatively adaptive loci presented in Table 3. 
Default parameters for the Markov chain were used (20 pilot runs 
of 5000 iterations, 100,000 total iterations). The posterior of the 
covariance matrix was then applied as the nuU model to investigate 
whether allele frequencies for our loci of interest are correlated 
with environment using a Bayesian framework. Evidence for 
correlations between environment and SNP loci was interpreted 
based on Jeffrey's scale [56]. 

Isolation by adaptation. Genetic relationships estimated 
upon loci that have been targets of local adaptation might be 
expected to reflect differences in local selection pressure between 
populations [60]. Tests for covariance of pairwise matrices 
capturing genetic and environmental dissimilarity, such as the 
Mantel test, can be applied to examine relationships between 
environment and population genetic differentiation (isolation by 
environment) to identify potentially adaptive structure. However 
in performing such tests there is a need to isolate the effects of 
environment from those of population history on genetic 
differentiation, by incorporating information on patterns of 
divergence estimated from known neutral markers (i.e. nuSSR). 

Partial Mantel tests were subsequendy performed in the R 
package Ecodist [61], to test for covariance between pairwise 
population genetic differentiation (Fs i) at SNP loci identified as 
possible targets of diversifying selection (Table 3) and pairwise 
dissimilarity for component environmental variables (Table 1). 
Pairwise population Fgx was estimated independendy for the six 
SNP loci listed in Table 3 using Alrequin ver. 3.5 [62]. Prior to 
performing tests, linearity of the relationship between matrices was 
confirmed by viewing correlograms generated using the 
"pmgram" function (divided into 12 genetic distance bins) 
following Goslee et al. [61]. Using the "distance" function, 
component environmental parameters were converted into a 
matrix of population pairwise environmental Euclidean distances. 
Correlations between matrices of genetic differentiation for each of 
the putatively adaptive SNP loci and dissimilarity for each of six 
component environmental variables were examined using the 
"mantel" function. This was performed both with and without 
partiaUing out demographic effects on genetic differentiation by 
including a third matrix of pairwise population genetic differen- 
tiation estimated from 15 nuSSR loci. Significance was tested with 
1000 permutations of the data in each case. 

Results 

Genetic diversity and divergence 

SNP diversity generally conformed to Hardy Weinberg 
expectations within populations (55 out of 59 loci). When analysed 
as a combined sample, 76 percent of SNPs departed expectations, 
reflecting a Wahlund efiect due to population structure. When 
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estimated over all SNP loci, genetic diversity was moderate 
(He = 0.22) ranging from 0.19 at Normanby River (QLD) to 0.26 
at Fortescue (WA). The overall level of LD between SNPs was low 
with only 0.6% of pairwise correlations between sites (R^) 
exceeding 0.2. This indicates that the majority of loci screened 
segregated independently across the 191 individuals sampled. 
Genetic differentiation (Fst) estimated on all SNPs among 
populations (Fsx = 0.17, P<0.01) was greater than the estimate 
based on 15 nuSSR loci in the same populations (excluding 
Wirrengren plain) (Fgx = 0.08) [40] . Further breakdown of genetic 
variation via analysis of molecular variance (AMOVA) revealed 
that 60% of the genetic variance among populations based on 
SNP loci occurred within individuals, while 23% was attributed to 
variation among individuals, and 17% among populations. 

Pairwise population divergence estimated from SNP and SSR 
loci were moderately correlated (R^ = 0.27, p<0.001) ( Figure S2), 
suggesting that while overall divergence was higher among SNP 
loci, the hierarchical relationships among populations were 
similar. Genetic relationships among populations, inferred from 
principal coordinate analysis (PCoA), showed clear geographic 
trends (grouping of populations by subspecies) and also point to 
similarity between SNP and SSR divergence at the level of 
subspecies (Figure S3). Differences were also evident, for example, 
populations belonging to the subspecies arida (central Australia) 
and refulgens (pilbara) had greater affinity to subspecies minima 
when relationships were inferred from SNP markers than nuSSRs. 
Isolation by distance (IBD) was found to be significant when 
divergence at SNP loci among population pairs was compared 
with physical distance separating populations (R^ = 0.36, p<0.001) 
(Figure S4). This result suggests that the distance over which pollen 
and seed are dispersed is likely to be a major determinant of gene 
flow in this species. Significant IBD was also identified by Butcher 
et al. [40] among populations of E. camaldulensis based on SSR 
loci. 

Detection of adaptive genetic variation 

FsT outlier tests. To examine whether variation in SNP 

allelic diversity among populations could be explained by 
selection, differentiation (Fsi) at candidate loci (whole genes and 
SNPs) was compared to a neutral Fs i distribution simulated under 
a hierarchical island model [52]. Fgx oudier tests identified ten 
SNPs (from: PIP2 (4 SNPs), Dehydrin (3 SNPs), Erecta (1 SNP), 
COBL4 (1 SNP) and COMTl (1 SNP)), as well as three genes 
(PIP2, Dehydrin and COMTl), whose divergence was greater 
than expected under neutrality (p<0.01), and whose pattern of 
allelic diversity among populations might be accounted for by 
diversifying selection (Figure 2; Tahk; S5). We also identified three 
SNPs and one gene (ERECTA) which were significant at the less 
conservative threshold of p<0.05. Oudier loci accounted for 22% 
of all tests performed, exceeding the experiment wise error rate (a) 
of 5%, suggesting a low false positive rate overall. This is supported 
by significance of the FDR statistic, or q-value, for outlier loci 
which ranged between 0 and 0.05 (Table S5). 

Oudier testing using an alternative Bayesian analyses identified 
tiiree loci (from: Erecta (1 SNP) and PIP2 (2 SNPs)) where 
evidence in support of the model including diversifying selection 
was substantial (LogioBayes factor >0.5<1) to very strong 
(LogioBayes factor >1<2) (Table S6). Two additional loci from 
Dehydrin showed weak evidence for the adaptive model 
(LogioBayes factor >0<0.5) based on Jeffrey's scale. Importantly, 
each of these loci were also identified by the first method 
employing coalescent simulations. BayeScan is expected to be a 
more conservative test for outiiers than other methods, as it allows 
for deviations from a simple island model. Model averaging from 



the posterior distribution in BayeScan also tended to underesti- 
mate SNP differentiation (Egx) compared to the method-of- 
moment estimates from allele frequency variance components in 
Fdist2 and GenAlEx. 

The adaptive hypothesis suggested for several SNP loci 
following outlier tests was further supported by covariance of 
heterozygosity (He) and environment among populations. Popu- 
lation level heterozygosity estimated from a reduced data set 
containing outlier SNP loci only was significantly correlated with 
environment (e.g. glimPCAI (R^ = 0.48, p<0.001), ecolPCAl 
(R2 = 0.30, p<0.01,5) and geogPCA2 (R^ = 0.45, p<0.002)) 
(Figure 3). Covariance of He and environment was also observed 
at the level of individual oudier genes, where average He for PIP2, 
Dehydrin and COMT was positively correlated with climPCAI 
(Figure S5). 

Association of allelic and environmental variation. To 

investigate further the adaptive evolutionary model suggested for 
some SNP loci following outiier tests, we looked for evidence of 
variation in selective constraints among populations that could 
explain allelic diversity. Tests of association between allelic 
variation and component environmental variables were performed 
using several alternative approaches. Firstiy, logistic regression of 
allelic variation and component environmental variables was 
performed using MatSAM. This revealed associations between 
environment and allele frequency that were significant for both the 
LRT and Wald test following Bonferoni correction for number of 
loci, and supported a hypothesis of adaptive evolution at eleven 
outiier loci (Table S7). However the method implemented in 
MatSAM does not account for the presence of population 
structure, previously detected among river red gum populations, 
and may be prone to identifying false positive associations [63] . 

To correct for population structure SNP-environment associa- 
tions were tested via two further approaches, a general linear 
model (GLM) implemented in the program Tassel to test for 
covariance of genotype and environmental scores across individ- 
uals, [49], and a Bayesian model to test for covariance of allele 
frequencies and environment across populations [59]. Population 
structure was accounted for in the GLM and Bayesian models as 
either a matrix of component environmental scores, or a SNP 
covariance matrix estimated via a MCM chain respectively. In 
total, six SNP loci previously identified as Fgx outiiers were 
significantiy associated with one or more environmental \ ariable 
under the GLM (Table 3, Table SB and Figure S6). The inclusion 
of population structure in this model reduced the number of 
associations by 70%, suggesting that without correction the false 
positive rate due to neutral structure is high. Estimated SNP effect 
sizes (R^ or the proportion of environmental variance explained by 
the SNP locus), ranged between three and twelve percent, which at 
the upper end is large compared to those typically observed for 
quantitative traits in trees [64—66]. Inflated effect sizes for some 
loci could reflect increased variance in environmental values due 
to the undc'rpowered nature of the study owing to the small 
number of populations and individuals sampled [67]. 

The method of Coop was more conservative, possibly owing to 
the small size of the SNP data set and potential for non-neutral loci 
among SNPs applied in estimating the covariance matrix. 
Evidence for correlations between environment and four of six 
SNP loci tested using the Bayesian framework was substantial 
(LogioBayes factor >0.5<1) based on Jeffrey's scale (Table 3). 
The associations summarised in Table 3, those presenting strong 
evidence for local adaptation in multiple tests, are illustrated as a 
function of environmental variation in Figures 4 and S6. 

Genetic structure at adaptive loci. The relationship 
between environmental dissimilarity and hierarchical population 
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Figure 2. Fst estimates for individual SNP loci and whole genes plotted as a function of heterozygosity. Fsr estimates were also plotted 
for the set of 1 5 SSR markers applied in Bucher et al. 2009. Fst estimates for SNPs or genes that sat above the upper or lower boundary of the neutral 
envelope simulated in Arlequin were considered as potential targets of diversifying or homogenising selection respectively. 
dor:l 0.1 371/journal.pone.01 0351 5.g002 



structure inferred from putatively adaptive loci was assessed via a 
partial mantel test, while accounting for demographic signals 
inferred from SSR markers. This approach identified several 
outlier SNP loci, where an allelic relationship with environment 
had been detected, that exhibited genetic structure reflecting 
environmental differences among populations (or isolation by 
environment) (Table 3, Figure 5). In total, dissimilarity matrices 
for five SNP-environment combinations were significant following 
Bonferoni correction (pSO.OOl). The results suggest that the 
selected environments could have driven genetic structure in these 
cases, and supports inferences of adaptive selection at these loci 
from outlier and association tests. Pairwise divergence at SNP58 
from the Pip2 gene is illustrated as a function of dissimilarity in 
climPCAI which was significant both before and after accounting 
for neutral structure (Figure 5). We also observed that differenti- 
ation at this locus was significantiy correlated with genetic 
differentiation for SNP37 (ERECTA) and SNP56 (PIP2) (p< 
0.001), which were similarly correlated with dissimilarity for 
climPCAI (Table 3). 

Discussion 

Natural populations occurring across environmental gradients 
offer opportunities to detect signatures of selection resulting from 
local adaptation, and test specific hypotheses about the spatial, 
environmental and temporal scales over which adaptive evolution 



is exerted [10,11,51]. In trees, studies of adaptive evolution have 
primarily focused on adaptation to long-standing environmental 
cfines because of their long generation times [10]. Here we 
examined genetic diversity and evidence of adaptation at 
candidate gene- SNP loci sampled in em-ironmentally contrasted 
natural populations of Eucalyptus camaldulensis across the species 
range. These populations are differentiated for climate, primarily 
reflecting variation in rainfall, evaporation, temperature and 
sunlight which potentially drive differences in water availability, 
a strong driver of adaptation in diverse plant species [68,69]. The 
populations were also variable for a number of ecological indices, 
soil and geographic features which could act as selective 
constraints [70,71]. Throughout the early to mid Holocene 
conditions over most of Australia were wetter than the present 
day, and climate records suggest the onset of modern conditions 
from around 4000 years ago [72,73]. We therefore expect that 
differences in climate among populations inferred from modern 
instrumental recordings have prevailed sufficientiy long to provide 
opportunities for local adaptation over multiple generations. 

Local adaptation among provenances and populations of E. 
camaldulensis has previously been suggested from variation in 
adaptive phenotypes which correspond to local environment 
including: morphological traits (growth form, leaf thickness, 
stomatal density, depth of root system, root to shoot ratio and 
phenology) [38,74—77]; growth rate (height, diameter) [78]; wood 
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properties (density, shrinkage, fibre length) [79,80]; physiological 
responses (water use efficiency, stomatal conductance, CO2 
assimilation) [81-83]; and drought toUerance [76,81,82,84,85]. 
The adaptive clines suggested from phenotypic variation are 
supported by recent evidence of genetic adaptation within coding 
genes [33,41,42]. In the present study we surveyed a modest 
number of candidate gene SNP loci in a broad sample of 
environmentally contrasted E. camaldulemis provenances approx- 
imating the species natural distribution. This approach provided 
for the first time a coarse indication of the distribution of 
potentially adaptive variation in this species at a broad landscape 
scale. 

Adaptive variation in candidate genes 

We identified six putative examples of adaptive evolution based 
on variation in SNP allele frequencies among populations and co- 
variation with environment (Table 3). These SNP loci, located in 
four genes, were significant in outlier tests and also were 
significandy associated with environment in MatSAM and one 
or both of Tassel and the MCMC method of Coop et al. [59]. 
Additionally, partial mantel tests revealed several loci where 
population genetic relationships were shown to be correlated with 
environmental dissimilarity among populations, suggestive of local 
adaptation. Of the six SNP loci identified as possible targets of 
diversifying selection, the majority (83yo) reside in genes predicted 
to have direct roles in plant water relations (e.g. PIP2, Dehydrin 
and ERECTA). The over representation of outlier loci from this 
functional class is perhaps compelling given SNPs within "water 
use" genes accounted for only 25% of the data set. This suggests 
that genes directiy impacting water movement or dehydration 
response for example may be more likely to be subject to 



adaptation in populations where climatically driven water 
availability is contrasted compared to genes with other functions, 
such as structural cell wall genes. This is consistent with studies 
identifying aquaporins and dehydrins as candidates for adaptive 
evolution in response to water availability [22,28,86-88]. Erecta 
has previously been shown to regulate transpiration efficiency in 
Arahidnpsis [4,5], and has been linked with drought adaptation in 
provenances of Populus nigra (Viger, unpublished data). In a 
recent gene expression study in E. camaldulensis both PIP2 and 
ERECTA were found to be difiFerentiaJIy expressed (down 
regulated) in droughted compared to well watered E. camaldu- 
lensis plants [42] . This points to the importance of two of these 
genes in response to drought and as possible targets of selection for 
adaptation in natural populations. 

Environmental associations 

Associations with environment provided support for local 
adaptation to explain allele frequency variation among popula- 
tions at outlier loci. Significant co-variation of average expected 
heterozygosity (He) for outliers and climate initially suggested this 
(Figure 3; Figure S5). The tendency for higher diversity at hotter, 
drier sites could potentially reflect a clinal shift in balancing 
selection favouring heterosis [89,90], however this explanation is 
not supported by observed numbers of heterozygotes, and the 
trend is more likely to coincide with directional selection of alleles 
along the environmental chne. Signfficant association of allele 
frequency and genotype with environment, mainly climate 
(cumPCAI), was subsequendy confirmed for several outlier loci 
using three different methods while accounting for demographic 
effects (Table 3). The results suggest that variation in climate, 
specifically temperature and evaporation is potentially important 
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Figure 4. Minor allele frequency plotted as a function of environment (climPCAI) for two outlier loci (top), for SNP 58 (C allele 
plotted, C/T SNP) (PIP2) and SNP 37 (G allele plotted, G/T SNP) (ERECTA); (middle) significant genotypic associations are illustrated 
as box plots for climPCAI as a function of genotype, and (bottom) in each case the map illustrates spatial and environmental 
structuring of allelic diversity, potentially reflecting local adaptation at these loci. The R^ and p-values displayed on each plot represent 
the proportion of variance in environmental parameters explained by the SNP maker and significance of the observed relationship. 
doi:1 0.1 371 /journal.pone.01 0351 5.g004 



as a driver of adaptation in this gene set, and to a lesser extent 
rainfall, species richness and soil type. 

The best supported cases for selection are illustrated by two 
outlier SNP loci, namely SNP37 (G/T) from Erecta (a putative 
leucine-rich repeat receptor-like kinase) and SNP58 (C/T) from 
PIP2 (an aquaporin), which were both associated with climate 
(glimPCAI) and exhibited significant adaptive structure (Figure 4; 
Table 3). These loci exhibit similar clines in allele frequency with 
respect to climPCAI which could indicate positive directional 
selection. Low levels of linkage disequilibrium (LD) in natural 
populations of river red gum [41] could mean that selection may 



be acting on these loci directly, or on a closely linked locus. The 
frequency of the minor (C) allele of SNP58 is decreased in 
populations where both temperature and evaporation is highest 
(e.g. more positive values of climPCAI). Tests of association 
indicate this cline in allele frequency reflects a greater proportion 
of T:T homozygotes at this locus in the driest populations. The 
heterozygote at this locus is associated with climates intermediate 
to the two homozygotes, suggesting an additive mode of gene 
action on an unknown adaptive trait. Similarly for ERECTA, an 
increase in the minor (G) allele frequency was associated with 
reduced values of climPCAI (wetter, more mesic, conditions). The 
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Figure 5. Covariance of pairwise genetic differentiation (Fst) fof SNP58 (PIP2 gene) and environment may be indicative of adaptive 
genetic structure, as suggested by pairwise population Fst for SNP 58 visualised as a function of (a pairwise population 
dissimilarity for climPCAl, and b) pairwise population dissimilarity for climPCAl adjusted for demographic effects on genetic 
diversity, where both variables are plotted as the residuals of their linear model with population pairwise Fst based on 15 
putatively neutral SSR loci [40]. Population pairwise Fst for SNP58 exhibited significant covariance with estimates for two putatively adaptive loci 
c) SNP37 (ERECTA), and d) SNP56 (PIP2). Correlation coefficients (R squared) and p values for Mantel (a, c, d) and partial Mantel (b) tests are presented 
in each case. 
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cline in allele frequency at this locus reflects a greater proportion of 
T:T homozygotes in the driest populations, and both the 
heterozygote G:T and homozygote G:G are associated with lower 
temperature and evaporation, suggesting a dominant mode of 
gene action (Figure 4). Both of these loci are silent, occurring in 
intronic .sequence and do not code a change in the predicted 
protein product. It is possible that the target of selection is a closely 
linked locus which is amino acid changing, however it is also 
feasible that these silent mutations could be functional variants 
which are themselves under selection. Examples of functionality of 
non-coding polymorphisms, including cis-acting regulatory ele- 
ments, have been observed in eucalypts and other species [9 1 ,92] . 



The relative grouping of populations based on allele frequency 
and environment for the two loci in Figure 4 suggests relationships 
that are congruent with geographic and neutral population 
structure. The eight populations with low values of climPCAI 
and high frequency of the C allele for SNP58 belong to either the 
Murray-Darling Basin or Spencer Gulf provenances. A similar 
pattern is observed for SNP37. While it is possible that 
autocorrelation of climate and population demography in this 
species [40] has increased the risk of false association, we have 
been careful to account for neutral population structure in these 
analyses. The bias towards water use genes among associated 
SNPs also suggests a functional basis to the co-variation with 
environment. These results implore the use of multiple comple- 
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mentary approaches and careful consideration of potentially 
confounding population structure in studies aiming to differentiate 
between allelic variation arising from adaptation and neutral 
demographic processes in this species. 

Genetic divergence (Fst) among populations for several puta- 
tively adaptive loci was related to population level dissimilarity for 
climPCAI and geogPCA2 (Table 3). The results suggest that the 
specific environments loading to each multivariate parameter could 
have constrained genetic relationships among populations at these 
loci, and provide support for inference of local adaptation from 
outlier and association tests. Similarities in the inferred genetic 
relationships were observed for some loci. For example, pairwise 
population genetic divergence estimated for SNP58 (PIP2), SNP37 
(ERECTA) and SNP56 (PIP2) was significantly correlated, and in 
each case the hierarchical relationships co-varied with dissimilarity 
in climPCAI (Table 3, Figure 5). Given low linkage disequilibrium 
(LD) between SNP37 and SNP56 (R^ = 0.008) co-variation of their 
inferred population genetic structure could point to concerted 
selection acting upon unlinked loci, both in genes influencing plant 
water relations, in response to a common selection pressure [60]. 
Conversely, the correlation in divergence patterns with SNP58 and 
SNP.'iG, which reside in the PIP2 gene, more likely reflects linkage 
over short physical distances (R^ - 0.41). 

Conclusions 

Inference of local adaptation in E. camaldulensis based on these 
results is limited by the small number of loci and individuals 
examined; however they draw our attention to the potential for 
further studies of adaptive variation in this species, and suggest 
that selection in response to climate has driven genetic differences 
among populations at the landscape scale. With the generation of 
genome wide SNP datasets which partition adaptive and neutral 
genetic variation there arises opportunity for application of genetic 
markers for the management of forest resources in the face of 
climate change. This could include monitoring populations for 
evidence of, or assessing potential for, genetic adaptation by 
measuring standing genetic diversity and screening adaptively 
important variants in populations under threat [7,15,93,94]. 
Linking SNP diversity at putatively adaptive loci with phenotypic 
variation via association studies achieves an important validation 
of adaptive variants identified in population genetic studies, and 
provides a tangible mechanism by which managers can assess 
adaptive phenotypes in natural and planted forests. In E. 
camaldulensis, interrogation of larger SNP data sets at the 
landscape scale, complemented by genotype-phenotype associa- 
tion studies under different environments should be the next steps 
towards generating data sets which could be applied to these ends. 

Supporting Information 

Figure SI Latitudinal chnes were obseved for each of the six 
principal components derrived from environmental variables. 

Latitude (deg.) is plotted on the x-axis and PCA casewise scores for 

populations on the y-axis in each case. 

(TIF) 

Figure S2 Mantel correlation of pairwise population Fst 
estimated on all SNP loci as compared to the 15 nuSSR loci 
from Butcher et al 2009 (R^ = 0.27, p<0.001). 
(TIF) 

Figure S3 Genetic relationships among populations, inferred 
from principal coordinate analysis (PCoA) for (a) 59 SNP and (b) 
15 nuSSR markers which indicate grouping by sub species: subsp. 



minima (•), subsp. obtusa (o), subsp. arida ( ^ ), subsp. refulgens 

(A), subsp. simulata (O) and subsp. camaldulensis (A). 
(TIF) 

Figure S4 Mantel correlation of pairwise population Fst 
estimated from ANOVA variance components on all SNP loci as 
compared to physical distance between populations in kilometres 
(km) identified significant isolation by distance (R^ = 0.36, p<0.001). 
(TIF) 

Figure S5 Heterozygosity (y-axis) estimated within populations 
for outlier genes plotted as a function of environment (climPCAI) 
for: a) Dehydrin (R2 = 0.44; p<0.001), b) PIP2 (R2 = 0.39; p< 
0.003), c) COMT (R2 = 0.45; p<0.001). 

(TIF) 

Figure S6 Variation in population level allele frequency (x-axis) 
for the six outlier SNP loci presented in Table 3 and principal 
components derived from environmental variables (y-axis) which 
were significandy associated. 

(TIF) 

Table SI Mean annual estaimtes for environmental variables 
apphed in principal component analyses. 

(DOCX) 

Table S2 Correlations (loading) between environmental vari- 
ables and principal components. 

(DOCX) 

Table S3 Summary of the 59 SNP loci used in this study. Minor 
allele frequency (MAF) was determined over all populations. 
Amino acid abbreviations according to lUPAC conventions. 
(DOCX) 

Table S4 Biallelic genotype calls for all SNP loci screened across 
each of the 191 £. camaldulensis individuals used in this study. 
(CSV) 

Table S5 Whole genes and SNP loci identified as having 
divergence more extreme than expected when compared to the 
neutral distribution simualted in Arlequin. 

(DOCX) 

Table S6 SNP loci for which a model including selection was 
supported following analyses with BayeScan. 

(DOCX) 

Table S7 SNP alleles exhibiting significant covariation (p£0.05) 
with environment following bonferroni correction for both the 
Wald and likelihood ratio test implemented in SAM. 

(DOCX) 

Table S8 Significant assocaitions (pS0.05) identified with 
component environemntal variables following adjustment for 
popualtion structure and multiple testing (permutation) in Tassel. 
(DOCX) 

Acknowledgments 

The authors would like lo lliank Penny Bulchcr for providing E. 
camaldulensis DNA accessions from Across Australia for use in this study. 

Author Contributions 

Conceived and designed the experiments: SD RM DSB GNR YP SGS. 
Performed the experiments: SD RM DSB (JNR YP SCiS. Analyzed the 
data: SD RM. Contributed reagents/materials/analysis tools: SD DSB 
GNR YP SGS. Contributed to the writing of the manuscript: SD RM 
DSB. 



PLOS ONE I www.plosone.org 



13 



August 2014 I Volume 9 | Issue 8 | e103515 



Environmental Adaptation of the Australian River Red Gum 



References 

1. Allen CD (2009) Climate-induced forest diehack: an esealating global 
phenomenon? Unasylva 231/232 60: 43-49. 

2. CarniccrJ, Coll M, Ninycrola M, Pons X, Sanchez G, ct al. (201 1) Widespread 
crown condition decline, food web disruption, and amplified tree mortality with 
increased climate change-type drought. Proc Natl Acad Sci USA 108: 1474— 
1478. 

3. Hanna P, Kulakowski D (20 1 2) The influences of climate on aspen dieback. For 
Ecol Manag 274: 91-98. 

4. Lindner M, Maroschek M, Netherer S, Kremer A, Barbati A, et al. (2010) 
Climate change impacts, adaptive capacity, and vulnerability of European 
forest ecosystems. For Ecol Manag 259: 698-709. 

5. Aitkcn SN, Whitlock MC (2013) Assisted Gene Flow to Facilitate Local 
Adaptation to (..limatc Change. Annual Review of Ecology, Evolution, and 
Systcmatics 44: 367-388. 

6. Barrett RDH, Schluter D (2008) Adaptation from standing genetic variation. 
Trends Ecol Evol 23: 38^4. 

7. Hansen MM, Olivieri I, WaUer DM, Nielsen EE, Ge MWG (2012) Monitoring 
adaptive genetic responses to environmental change. Mol Ecol 21: 131 1—1329. 

8. Sgro CM, Lowe AJ, Hof&nann AA {201 1) Building evolutionary resilience for 
conserving biodiversity under climate change. Evolutionary AppHcations 4: 
326-337. 

9. Alberto FJ, Aitken SN, Alia R, Gonzalez-Martinez SC, Hannincn H, et al. 
(2013) Potential for evolutionary responses to climate change evidence from 
tree populations. (Global Change Biologx' 19: 1645-1661. 

10. Neale DB, Kremer A (2011) Forest tree genomics: growing resources and 
applications. Nat Rev (ienet 12: 111-122. 

1 1. Sork VL, Aitkcn SN, Dyer RJ, Eckcrt AJ, Lcgendre P, ct al. (2013) Putting the 
landscape into the genomics of trees: approaches for understanding local 
adaptation and population responses to changing climate. Tree Genetics & 
Genomes 9: 901-911. 

12. Kremer A, Ronce O, Robledo-Amuncio JJ, GuiUaume F, Bohrer G, et al. 
(2012) Long-distance gene flow and adaptation of forest trees to rapid climate 
change. Ecol Lett 15: 378-392. 

13. Aitken SN, Yeaman S, HoUiday JA, Wang TL, Curtis-McLane S (2008) 
Adaptation, mii;ralioi: i>r extirpation: climate change outcomes for tree 
populations. Evolulionarv Applications 1: 95-111. 

14. Parmesan C (2006) Ecological and evolutionary responses to recent climate 
change. Annual Review of Ecology Evolution and Systcmatics. pp. 637—669. 

15. Hoffinann AA, Sgro CM (2011) Climate change and evolutionary adaptation. 
Nature 470: 479-485. 

16. Hof&nann AA, Willi Y (2008) Detecting genetic responses to environmental 
change. Nat Rev Genet 9: 421^32. 

17. Lynch M (2007) The frailty of adaptive hypotheses for the origins of organismal 
complexity. Proc Nad Acad Sci USA 104: 8597-8604. 

18. Reusch TBH, Ehlers A, Hammerh A, Worm B (2005) Ecosystem recovery after 
climatic extremes enhanced by genotypic diversity. Proc Nati Acad Sci USA 
102: 2826-2831. 

19. Schaljerg PC, DcHayes DH, Hawley G], Nijensohn SE (2008) Anthropogenic 
alterations of genetic diversity within tree po];)ulaUons: Implications for forest 
ecosystem resilience. For Ecol Manag 256: 855—862. 

20. Savolainen O, Pyhajarvi T, Knurr 1' (2007) Gene flow and local adaptation in 
trees. Annual Review of Ecology Evolution and Systcmatics 38: 595-619. 

21. Ingvarsson PK (2005) Nucleotide polymorphism and linkage disequilbrium 
within and among natural populations of European Aspen (Populus tremula L., 
Salicaeeae). (ieneties 169: 945-953. 

22. Eveno E, (?.ollada C, (i-uevara MA, Legcr V, Soto A, et al. (2008) Contrasting 
patterns of selection at Pinus pinaster Ait. drought stress candidate genes as 
revealed by genetic differentiation analyses. Mol Biol Evol 25: 417-437. 

23. Namroud MC, Beaulieu J, Jtige X, Laroehe J, Bousquet J (2008) Scanning the 
genome for gene single nucleotide polymorphisms involved in adaptive 
population differentiation in white spruce. Mol Ecol 17: 3599-3613. 

24. Eckert AJ, Bower AD, Gonzalez-Martinez SC, Wegrzyn JL, Coop G, et al. 
(2010) Back to nature: ecological genomics of lobloUy pine (Pinus taeda, 
Pinaceae). Mol Ecol 19: 3789-3805. 

25. Eckert AJ, Bower AD, Wegrzyn JL, Pande B, Jermstad KD, et al. (2009) 
Asssociation Genetics of Coastal Douglas Fir (Pseudotsuga menziesu var. 
menziesii, Pinaeeae). I. Gold-Hardiness Related Traits, (genetics 182: 1289- 
1302. 

26. Eekert AJ, van Heerwaarden J, Wegrzyn JL, Nelson CD, Ross-Ibarra J, et al. 
(2010) Patterns of Population Structure and Environmental Assoeialions to 
Aridity Across the Range of Loblolly Pine (Pinus taeda L., Pinaceae). (jcnetics 
185: 969-982. 

27. Eckert AJ, Wegrzyn JL, Pande B, Jermstad KD, Lee JM, et al. (2009) 
Multilocus Patterns of Nucleotide Diversity and Divergence Reveal Positive 
Selection at Candidate Genes Related to Cold Hardiness in Coastal Douglas 
Fir (Pseudotsuga menziesii var. menziesii). Genetics 183: 289—298. 

28. Wachowiak W, Balk PA, Savolainen () (2009) Search for nucleotide diversity 
patterns of local adaptation in dehydrins and other cold-related candidate 
genes in Scots pine {Pinus sylvestris L.). Tree Genetics & Genomes 5: 1 17— 132. 



29. HoUiday JA, Ritland K, Aitken SN (2010) Widespread, ecologicaUy relevant 
genetic markers developed from association mapping of climate-related traits in 
Sitka spruce (Picea sitchensis). New Phytol 188: 501-514. 

30. PrunierJ, Laroehe J, BeauheuJ, BousquetJ (2011) Scanning the genome for 
gene SNPs related to climate adaptation and estimating selection at the 
molecular level in boreal black spruce. Mol Ecol 20: 1702-1716. 

31. Bradbury D, Smithson A, Krauss SL (2013) Signatures of diversifying selection 
at EST-SSR loci and association with climate in natural Eucalyptus 
populations. Mol Ecol 22: 5112-5129. 

32. Dillon SK, Nolan MF, Matter P, Gapare WJ, BraggJG, et al. (2013) Signatures 
of adaptation and genetic structure among the mainland populations of Pinus 
radiata (D. Don) inferred from SNP loci. Tree Genetics & Genomes 9: 1447— 
1463. 

33. Bragg JG, DiUon SK, Soutiierton SG, Young AG (2014) Genome-wide 
signatures of molecular adaptation in environmentaUy contrasted populations 
of Eucalyptus camaldulensis, a riparian tree. New Phytol pending review. 

34. Brookcr MIH, Kleinig DA (1999) Filed Guide to Eucalypts. Hawthorn 
Bloomings Books. 

35. Cunningham SC, Mac NaUy R, Read J, Baker PJ, White M, et al. (2009) A 
Robust I'eehnique for Mapping Vegetation Condition Across a Major River 
System. Ecosystems 12: 207-219. 

36. Cunningham SC, Thomson JR, Read J, Baker PJ, Mac NaUy R (2010) Does 
stand structure influence susceptibility of eucalypt floodplain forests to dieback? 
Austral Ecol 35: 348-356. 

37. Wen L, Ling J, Saintilan N, Rogers K (2009) An investigation of the 
hydrological requirements of River Red Gum (Eucalyptus camaldulensis) 
Forest, using Classification and Regression Tree modelling. Ecohydrology 2: 
143-155. 

38. Davidson NJ, Reid JB (1989) Response of eucalypt species to drought. 
AustJ Ecol 14: 139-156. 

39. Gibson A, Bachelard EP, Hubick KT (1995) Relationship between climate and 
provengmce variation in eucalyptus-camaldulensis dehnh. AustJ Plant Physiol 
22: 453^60. 

40. Butcher PA, McDonald MW, Bell JC (2009) Congruence between environ- 
mental parameters, morphology and genetic structure in Australia's most 
widely distributed eucalypt, Eucalyptus camaldulensis. Tree Genetics & 
Genomes 5: 189-210. 

41. Kulheim C, Yeoh SH, Maintz J, Foley WJ, Moran GF (2009) Comparative 
SNP diversity among four Eucalyptus species for genes from secondary 
metabolite biosynthetic pathways. BMC Genomics 10: 1—11. 

42. Thumma BR, Sharma N, Southerton SG (2012) Transcriptome sequencing of 
Eucalyptus camaldulensis seedlings subjected to water stress reveals functiouEil 
single nucleotide polymorphisms and genes under selection. BMC Genomics 
under review. 

43. Doyle JJ, Doyle JL (1990) Isolation of plant DNA from fresh tissue. Focus 12: 
13-15. 

44. KaldenhoffR, Fischer M (2006) Aquaporins in plants. Acta Physiologica 187: 
169-176. 

45. Masle J, Gihnore SR, Farquhar GD (2005) The ERECTA gene regulates plant 
transpiration efficiency in Arabidopsis. Nature 436: 866-870. 

46. Rorat T (2007) Plant dehydrins - Tissue location, structure and function (vol 
11, pg 536, 2006). Cell Mol Biol Lett 12: 148-148. 

47. Chen ZZ, Hong XH, Zhang HR. Wang YQ, Li X, et al. (2005) Disruption of 
the cellulose synthase gene, AtC^csAH/lRXl, enhances drought and osmotic 
stress tolerance in Arabidopsis. Plant J 43: 273—283. 

48. Hall TA (1999) BioEdit: a user fi^endly biological sequence alignment editor 
and analysis program for Windows 95/98/NT. Nucleic Acids Symposium 
Series 41. 

49. Bradbury PJ, Zhang Z, Kroon DE, Casstevens TM, Ramdoss Y, et al. (2007) 
TASSEL: software for association mapping of complex traits in diverse 
samples. Bioinformatics 23: 2633-2635. 

50. Peakall R, Smouse PE (2006) GENALEX 6: genetic analysis in Excel. 
Population genetic software for teaching and research. Mol Ecol Notes 6: 288- 
295. 

51. Luikart G, England PR, Tallmon D, Jordan S, Taberlet P (2003) The power 
and promise of population genomics: From genotyping to genome typing. Nat 
Rev Genet 4: 981-994. 

52. Excoffier L, Hofer T, Foil M (2009) Detecting loci under selection in a 
hierarehieally structured population. Heredity 103: 285—298. 

53. Beaumont MA, Nichols RA (1996) Evaluating loci for use in the genetic 
analysis of population structure. Proceedings of the Royal Societ\' of London 
263: 1619-1626. 

54. Storey JD, I'ibshirani R (2003) Statistical significance for genomewide studies. 
Proceedings of the National Academy of Sciences USA 100: 9440-9445. 

55. Foil M, Gaggiotti O (2008) A Genome-Scan Method to Identify Selected Loci 
Appropriate for Both Dominant and Godominant Markers: A Bayesian 
Perspective. Genetics 180: 977-993. 

56. Jeffreys H (1961) The Theor\' of Probability. Oxford press. 

57. Joost S, Bonin A, Bruford MW, Despres L, Conord C, et al. (2007) A spatial 
analysis method (SAM) to detect candidate loci for selection: towards a 
landscape genomics approach to adaptation. Mol Ecol 16: 3955—3969. 



PLOS ONE I www.plosone.org 



14 



August 2014 I Volume 9 | Issue 8 | e103515 



Environmental Adaptation of the Australian River Red Gum 



58. Henderson GR (1975) Best linear unbiased estimation and prediction under a 
selection model. Biometrics 31: 423—447. 

59. Coop G, Witonsky D, Di Rienzo A, Pritehard JK (2010) Using Environmental 
Correlations to Identify Loci Underlying Local Adaptation. Genetics 185: 
1411-1423. 

60. Orsini L, Vanoverbeke J, Swillen I, MergeayJ, De Meester L (2013) Drivers of 
population genetic differentiation in the wild: isolation by dispersal limitation, 
isolation by adaptation and isolation by colonization. Mol Eeol 22: 5983 5999. 

61. (ioslee SG, Urban DL (2007) The ccodist package for dissimilarity-based 
analysis of ecological data. Journal of Statistical Software 22: 1-19. 

62. Excofficr L, Laval G, Schneider S (2005) Arlcquin vcr. 3.0: An integrated 
software package for population genetics data analysis. Evolutionary Bioinfor- 
matics Online 1. 

63. De Mita S, ThuiUet AG, Gay L, Ahmadi N, Manel S, et al. (2013) Detecting 
selection along environmental gradients: analysis of eight methods and their 
effectiveness for outbreeding and selfmg populations. Mol Ecol 22: 1 383 -1399. 

64. Gonzalez-Martinez SG, Huber D, Ersoz E, Davis JM, Xcale DB (2008) 
Association genetics in Pinus taeda L. II. Garbon isotope discrimination. 
Heredity 101: 19-26. 

65. Kulhcim G, Yeoh SH, WalUs IR, Laifan S, Moran GF, et al. (2011) The 
molecular basis of quantitative variation in foliar secondary metabolites in 
Eucalyptus globulus. New Phytol 191: 1041-1053. 

66. Dillon SK, BrawnerJT, Mcder R, Lee DJ, Southcrton SG (2012) Association 
genetics in Gorymbia citriodora subsp variegata identifies single nucleotide 
polymorphisms affecting wood growth and eellulosic pulp yield. New Phytol 
195: 596-608. 

67. loannidis JPA (2008) Why most discovered true associations are inllated. 
Epidemiology 19: 640-648. 

68. Paccard A, Vance M, WiUi Y (2013) Weak impact of fine-scale landscape 
heterogeneity on evolutionary potential in Arabidopsis lyrata. J Evol Biol 26: 
2331-2340. 

69. Ward D, Shrcstha MK, Golan-Goldhirsh A (20 1 2) Evolution and eeolog\' meet 
molecular genetics: adaptive phenotypic plasticity in two isolated Negev desert 
populations of Acacia raddiana at either end of a rainfall gradient. Ann Bot 
109: 247-255. 

70. Millar GI, Westf^l RD, Delany DL, Bokach MJ, Flint AL, et al. (2012) Forest 
mortality in high-elevation whitebark pine (Pinus albicaulis) forests of eastern 
California, USA; influence of environmental context, bark beetles, climatic 
water deficit, and warming. Canadian Journal of Forest Research-Revue 
Canadienne De Recherche Forestiere 42: 749-765. 

71. Mosca E, Gonzalez-Martinez SC, Neale DB (2014) Environmental versus 
geographical determinants of genetic structure in two subalpine conifers. New 
Phytol 201: 180-192. 

72. Allen RJ (1985) The Australasian summer monsoon, teleeonnections, and 
flooding in the Lake Eyre Basin by Robert J.Allan Royal Geographical Society 
Of Australasia. 47 p. 

73. Pedierick L, Bostock H, Cohen TJ, Fitzsimmons K, TibbyJ, et al. (2013) 
Climatic records over the past 30 ka fi-om temperate Austrjilia - a synthesis 
from the Oz-INTIMATE workgroup. Quaternary Science Reviews 74: 58-77. 

74. Awe JO, Shepherd KR, Florence RG (1976) Root development in provenances 
oi Eucalyptus camaldidensis Dehnh. Aust For 39: 201-209. 

75. Gibson A, Bachelard EP (1994) Relationships between site characteristics and 
sur^dval strategies of eucalyptus-eamaldulensis seedlings. 91-95 p. 

76. Gibson A, Bachelard E, Hubick K (1995) Relationship Between Climate and 
Provenance Variation in Eucalyptus Camaldulensis Dehnh. Funct Plant Biol 
22: 453^60. 

77. James S, Bell D (1995) Morphology and Anatomy of Leaves of Eucalyptus 
camaldulensis Clones: Variation Between Geographically Separated Locations. 

Aust J Bot 43: 415-433. 

78. Otegbeye GO (1985) Provenance productivity in eucalyptus camaldulensis and 
its implications to genetic improvement in the savanna region of nigeria. Silvae 
Genet: 121-126. 

79. SESBOU A, NEPVEU G (1978) Variabilite infraspcciliquc du retrait avec 
collapse et de la densite du bois chez Eucalyptus camaldulensis. Ann Sci forest 
35: 237-263. 

80. El-Lakany MH, El-Osta ML, Badran AO (1980) Evaluation of newly 
introduced Eucalyptus camaldiflensis provenances in Egypt. Alexandria 
Jounral of ^icutural Research 28: 309-319. 



81. Morshet S (1981) Physiological Activity, in a Scmiarid Environment, of 
Eucalyptus camaldulensis Dehn. from Two Provenances. Aust J Bot 29: 97— 
110. 

82. Gibson A, Hubick K, Bachelard E (1991) Effects of Abscisic Acid on 
Morphological and Physiological Responses to Water Stress in Eucalyptus 
camaldulensis Seedlings. Funct Plant Biol 18: 153-163. 

83. Hubick KTaAG (1993) Diversity in the relationship between carbon isotope 
discrimination and transpiration efficiency when water is limited. In: J. R 
Ehleringer, A. E Hall, G. D Farquhar, editors. Stable Isotopes and Plant 
Carbon Water Relations. New York: demie Press, pp. 311. 

84. (irunwald G, Karschon R (1983) Variation of eucalyptus-eamaldulensis from 
north-australia grown in Israel. Silvae Genet 32: 165-173. 

85. Lcmcoff JH, Guarnaschelh AB, Garau AM, Prystupa P (2002) Elastic and 
osmotic adjustments in rooted cuttings of several clones of Eucalyptus 
camaldulensis Dehnh. from southeastern Australia after a drought. Flora - 
Morpholo_g\'. Distribution, Functional Eeolooy of Plants 197: 134-142. 

86. Aiidigeos D, Buonamici A. Belkadi L, R\mrr P, Boshicr D. et al. (2010) 
Aquaporins in the wild: natural genetic divcrsitv and selective pressure in the 
PIP gene family in live Neotropical tree species. BMG Evol Biol 10. 

87. Maurel G, Verdoucq L, Luu DT, Santom V (2008) Plant aquaporins: 
Membrane channels with multiple integrated fiinctions. Annu Rev Plant 
Biol.pp. 595-624. 

88. Xia H, Camus-Kulandaivelu L, Stephan W, Tellier A, Zhang ZW (2010) 
Nucleotide diversity patterns of local adaptation at drought-related candidate 
genes in wild tomatoes. Mol Ecol 19: 4144-4154. 

89. Hao GP, Zhang XH, Wang YQ, Wu ZY, Huang CL (2009) Nucleotide 
Variation in the NCED3 Region of Arabidopsis thaHana and its Association 
Study with Abscisic Acid Content under Drought Stress. J Integr Plant Biol 51: 
175-183. 

90. Philippe R, Courtois B, McNally KL, Mournet P, El-Malki R, et al. (2010) 
Structure, allelic diversity and selection of Asr genes, candidate for drought 
tolerance, in Oryza sativa L. and wild relatives. Theor Appl Genet 121: 769- 
787. 

91. Thumma BR, Matiieson BA, Zhang DQ, Meeske C, Meder R, et al. (2009) 
Identification of a Cis-Acting Regulatory Polymorphism in a Eucalypt 
COBRA-Like Gene Affecting Cellulose Content. Genetics 183: 1153-1164. 

92. Salvi S, Sponza G, Morgante M, Tomes D, Niu X, et al. (2007) Conserved 
noneoding genomic sequences associated with a flowering-time quantitative 
trait locus m maize. Proe Nad Acad Sei U S A 104: 1 1376-1 1381. 

93. Schueler S, Kapeller S, Konrad H, Geburek T, Mengl M, et al. (2013) 
Adaptive genetic diversity of trees for forest conservation in a future climate: a 
case study on Norway spruce in Austria. Biodivers Gonscrv 22: 1151—1166. 

94. van Zonneveld M, Scheldeman X, Escribano P, Viruel MA, Van Damme P, et 
al. (2012) Mapping Genetic Diversity of Cherimoya (Annona cherimola MiU.): 
Application of Spatial Analysis for Conservation and Use of Plant Genetic 
Resources. Plos One 7. 

95. McDonald MW, Brooker MIH, Butcher PA (2009) A taxonomie revision of 
Eucalyptus camaldulensis (Myrtaceae). Aust Syst Bot 22: 257-285. 

96. Li EG, Lu SF, Chiang V (2006) A genomic and molecular view of wood 
formation. Grit Rev Plant Sci 25: 215-233. 

97. Doblin MS, Kurck I, Jacob-Wilk D, Delmcr DP (2002) CeUulose biosynthesis 
in plants: from genes to rosettes. Plant CeU Physiol 43: 1407-1420. 

98. Brown DM, Zeef LAH, Ellis J, Goodacre R, Turner SR (2005) Identification of 
novel genes in Arabidopsis involved in secondary cell wall formation using 
expression profiling and reverse genetics. Plant CeU 17: 2281—2295. 

99. Beck EH, Fettig S, Knake C, Hartig K, Bhattarai T (2007) Specific and 
unspecific responses of plants to cold and drought stress. Journal of Biosciences 
32: 501-510. 

100. Szyjanowicz PMJ, McKinnon I, Taylor NG, Gardiner J, Jarvis MG, et al. 
(2004) The irregular xylem 2 mutant is an allele of korrigan that affects the 
secondary cell wall of Arabidopsis thaliana. PlantJ 37: 730-740. 

101. Borevitz JO, Xia YJ, Blount J, Dixon RA, Lamb G (2000) Activation tagging 
identifies a conserved MYB regulator of phenylpropanoid biosynthesis. Plant 
CeU 12: 2383-2393. 

102. Baima S, Possenti M, Matteucci A, Wisman E, Altamura MM, et al. (2001) 
The Arabidopsis ATHB-8 HD-zip protein acts as a differentiation-promoting 
transcription factor of the vascular meristems. Plant Physiol 126: 643-655. 



PLOS ONE I www.plosone.org 



15 



August 2014 I Volume 9 | Issue 8 | e103515 



